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ABSTRACT 


This  document  is  the  final  report  for  a  research  project  sponsored  by 
the  United  States  Army  to  study  the  distribution  of  functionality  within  a 
distributed  system  so  as  to  provide  the  system  with  graceful  degradation. 
In  addition,  overall  system  capabilities  were  studied  in  an  effort  to 
describe  system  performance  as  system  resources  were  alleviated  due  to 


failure. 


1.0  INTRODUCTION 


The  currently  existing  systems  employed  by  the  military,  which  consist 
of  a  function  or  functions  residing  on  a  single  processing  unit,  fail  in 
the  worst  possible  way  in  the  field.  That  is,  when  the  processing  unit  is 
destroyed,  the  function  ceases  to  exist.  This,  coupled  with  the 
impossibility  of  reverting  to  a  manual  method  -  since  much  of  the  doctrine 
for  the  function  becomes  embedded  in  the  automated  system  -  leads  to  an 
intolerable  situation  in  the  field.  In  essence,  the  operational  capability 
of  a  unit  can  be  destroyed  by  destroying  its  computer. 

A  method  of  supplying  systems  which  differ  in  a  fundamental  way  from 
the  current  "point  system”  approach  was  studied.  This  method  is  based  upon 
a  combination  of  distribution  of  the  functionality  of  a  system  across  a 
network  of  processing  units  and  provisions  of  each  function  (and  the 
subfunctions  of  which  it  is  made  of)  in  a  number  of  locations  in  the 
network  with  a  number  of  levels  of  functionality.  In  operation,  the 
request  for  a  function  causes  the  best  instance  of  that  function  currently 
available  to  the  requesting  processor  to  be  invoked.  Graceful  degradation 
may  be  provided  by  making  this  best  function  dynamic  in  the  sense  that, 
depending  upon  the  currently  existing  system  configuration,  the  function 
can  be  rescheduled  if  it  was  lost  during  the  previous  scheduling  attempt  or 
if  the  scheduled  hardware  is  currently  unavailable.  Clearly,  such  an 
approach  would  ensure  the  availability  of  computer  capability  over  a  more 
extended  period  of  time  than  the  single  stand-alone  computers  presently 
being  employed.  Further,  there  are  other  advantages  to  utilizing  such  a 
distributed  system.  First,  a  more  powerful  capability  can  be  made 


available  by  Interconnecting  several  computer  systems  together  in  a 
distributed  fashion.  Secondly,  the  computer  systems  used  need  not  be  all 


that  expensive  since  the  distributed  system  will  require  that  functions  be 
segmented  in  such  a  way  that  they  can  run  on  a  less  powerful  and  less 
expensive  machine.  Thirdly,  since  the  system  modules  will  be  assumed  to  be 
identical  for  this  research  (but  in  reality  may  or  may  not  be),  it  is  much 
easier  and  more  economical  to  replace  any  faulty  or  destroyed  modules. 


2.0  RESEARCH  DESCRIPTION 

In  recent  years  distributed  processing  systems  have  been  a  subject  of 
interest  due  to  the  availability  of  computer  networks  and  the  availability 
of  microprocessors  for  use  in  inexpensive  distributed  computers. 
Therefore,  this  study  attempts  to  develop  a  scheduling  algorithm  for  a 
distributed  system,  to  improve  the  total  system  performance  and  to 
effectively  utilize  all  system  resources.  The  objective  of  the  scheduling 
algorithms  is  to  minimize  the  processing  time  of  the  system,  balance  the 
load  among  the  available  processors  and  to  increase  the  efficiency  of  the 
total  system.  This  study  also  attempts  to  develop  a  rescheduling  algorithm 
which  provides  a  means  of  regaining  a  function  when  it  has  been  lost  due  to 
loss  of  a  particular  constraint  attached  to  the  distributed  systems.  The 
introduction  of  the  microprocessor  has  made  distributed  processing  an 
increasingly  popular  notion  in  the  computer  industry.  Economics  of 
fabrication  have  substantially  reduced  the  cost  of  replacing  processors  in 
a  system,  making  distributed  processor  systems  economically  attractive. 
Microprocessors  have  created  an  environment  that  is  fostering  the  growth  of 
distributed  computation. 


By  a  distributed  computer  system  in  which  there  exist  several 
progammable  processors,  and  inwhich  typical  computations  visit  two  or  more 
processors  during  an  execution.  The  distributed  programs  that  we  consider 
in  our  analyses  are  assumed  to  be  made  up  of  modules  that  are,  in  general 
faceed  to  reside  on  any  processor  if  it  is  not  an  attached  module,  in  the 
distributed  system. 

Distributed  computer  systems  appear  to  offer  extensibility 
improvements  over  these  configurations  due  to  decentralization  of  the 
interconnection  and  control  logic  (both  hardware  and  software).  As  the 
system  is  scaled  up  in  size,  nonlinearities  and  boundary  conditions  in 
performance  are  less  likely  than  for  centralized  systems. 

2.1  The  Problem 

The  existing  systems  employed  by  the  military,  consist  of  a  function 
or  functions  residing  on  a  single  processing  unit  with  the  result  that  when 
the  processing  unit  is  destroyed,  the  function  ceases  to  exist.  This 
coupled  with  the  impossibility  of  reverting  to  a  manual  method-since  much 
of  the  doctrice  for  the  function  becomes  embedded  in  the  automated 
system-leads  to  an  intolerable  situation  in  the  field. 

A  method  of  supplying  systems  which  differ  in  a  fundamental  way  from 
the  current  "point  systems"  approach  is  proposed  for  investigation. 

This  method  is  based  upon  a  combination  of  distribution  of  the 
functionality  of  a  system  across  a  network  of  processing  units  and 
provisions  of  each  function  (and  the  sub-functions  of  which  it  is  made  of) 
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in  a  number  of  locations  in  the  network,  with  a  number  of  levels  of 
functionality.  In  operation,  the  request  for  a  function  causes,  the  best 
instance  of  that  function  currently  available  to  the  requesting  processor 
to  be  invoked.  Depending  upon  the  currently  existing  system  configuration, 
the  function  can  be  rescheduled  if  it  was  lost  during  the  previous 
scheduling  attempt  or  if  the  schedule  hardware  is  currently  unavailable. 
Clearly,  such  an  approach  would  ensure  the  availability  of  computer 
capability  over  a  more  extended  period  of  time  than  the  single  stand-alone 
computers  presently  being  employed. 

The  proposed  model  of  the  distributed  system  to  be  considered  in  this 
research  is  shown  in  Fig.  2.1.  Although  the  system  is  depicted  with  only 
five  modules  and  3  processors  there  may  be  many  more,  or  even  less  modules. 
Each  module  is  defined  to  have  a  set  of  resources,  such  as  CPU  speed, 
memory  size,  computational  power,  etc.  These  resources  may  or  may  not  be 
the  same  for  each  module.  The  solid  arrows  indicates  the  communications 
link  over  which  programs  and  data  are  transferred  between  the  various 
modules.  The  dashed  arrows  indicate  that  a  communications  link  could 
possibly  exist  between  the  modules  as  shown. 

As  stated  earlier,  the  number  of  modules  and  the  exact  interconnection 
of  these  modules  may  vary  depending  upon  the  configuration  that  supports 
the  most  efficient  scheduling  algorithm.  Therefore,  in  the  project  we 
investigated  and  developed  a  heuristic  that  will  schedule  functions  to 
modules  subject  to  constraints  that  are  well  defined. 
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2.2  Brief  Review  of  Literature 

|  Since  the  mid-1960's  several  Investigations  have  been  done  in  the  area 

of  distributed  computer  systems.  Distributed  systems  has  received 
Increased  attention  In  the  recent  literature.  Some  of  this  work  will  be 
discussed  briefly  next. 

Harold  S.  Stone  [5]  has  shown  how  the  program,  modules  of  a  program 
may  be  assigned  to  the  processors  in  a  distributed  computer  system  so  as  to 
minimize  the  overall  cost,  including  two  types  of  cost:  the  cost  of 

running  an  individual's  module  on  a  processor  and  the  cost  of 
interprocessor  communication  that  arises  in  the  event  of  transfer  of 
control  of  execution  from  one  processor  to  another. 

V.  B.  Gylys  and  J.  A.  Edwards  [4]  introduced  a  performance  measure 
for  a  real  time-distributed  network  and  discussed  computational  techniques 
for  obtaining  optimal  work  load  partitioning  over  a  network  configuration. 
Optimality  is  attained  by  the  assignment  of  programs  of  computers  which 
minimizes  the  intercomputer  bus  traffic,  subject  to  constraints  on  the 
maximum  loading  of  each  computer.  This  principle  could  be  used  to 
J  determine  workload  partitioning  both  at  design  and  in  real-time; 

i  furthermore,  at  design  time,  it  can  also  be  applied  to  finding  optimal 

;  network  configuration  for  a  given  software  design.  Gylys  and  Edwards, 

proceeded  with  formulation  of  an  optimally  criterion  for  workload 
distribution  and  derived  a  mathematical  optimization  problem;  subsequently 
it  examined  the  computational  techniques  for  solving  that  particular 
problem;  it  ended  with  a  critical  assessment  of  the  proposed  method. 

t 

! 

I 

i 
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The  problem  was  to  find  an  assignment  of  program  modules  to  the 
processors  in  the  network.  We  researched  Co  make  such  an  assignment 
efficient  by  using  the  method  of  Ford  and  Fulkerson  [8]  that  has  been 
developed  for  maximizing  flows  in  commodity  networks.  The  maximum  flow 
algorithm  was  extended  to  solve  the  multi-processor  models.  This  is  to  say 
that  the  value  of  a  maximum  flow  in  a  commodity  network  is  equal  to  the 
weight  of  a  minimum  weighted  cutset  of  the  network.  A  cutset  of  the 
commodity  network  is  a  set  of  edges  which  when  removed  disconnects  the 
source  nodes  from  the  sink  nodes.  This  is  explained  in  detail  in  Chapter 
II. 


W.  W.  Chu,  L.  Y.  Holloway,  M.  J.  Lan,  and  K.  Efe,  [10] 
concentrated  on  the  problem  of  task  allocation  in  distributed  data 
processing.  A  distributed  processing  system  has  conflicting  requirements 
and  this  paper  therefore,  made  a  compromise  to  find  the  optimum  assignment 
policy  for  a  task.  Different  approaches  for  solving  the  assignment 
problems  have  been  surveyed.  All  of  the  possible  methods  for  partitioning 
a  task  have  not  yet  been  fully  investigated,  although  some  promising 
attempts  have  been  reported. 

V.  Balanchandra ,  J.  W.  McGredie,  and  0.  I.  Mikhai  [1] 
investigated  the  job  assignment  problems  in  a  network  of  non-identical  but 
functionally  similiar  computers.  Periodic  review  models  are  formulated 
utilizing  (0-1)  integer  programming,  network  flow  algorithms, 
transportation  problems  and  heuristic  balancing  procedure.  They 
investigated  the  power  of  each;  to  determine  what  type  of  Information 
about  job  requirements  is  needed;  to  compare  the  processing  requirements 
and  the  quality  of  the  solution  for  each  formulation. 


10 


Edward  K.  Bowdoa,  Sr.  [3]  has  done  research,  aimed  at  developing 
analytical  tools  for  system  modeling  and  analysis  of  real-time  computer 
networks.  He  formulated  an  Idealized  mathematical  model  for  multiserver 
systems  with  a  finite  length  nonpreemptive  priority  queue.  Given  that  jobs 
consist  of  dependent  tasks  having  linear  loss  functions,  Bowdon  formulated 
an  algorithm  for  assigning  priorities  to  tasks.  He  defines  a  feasible 
successor  set  of  tasks  as  a  subset  of  tasks  which  can  be  scheduled 
independently.  Each  task  is  weighted  by  the  maximum  cost  rate  per  task 
over  every  feasible  successor  set  of  the  task  and  the  task  set  is  divided 
into  levels  based  on  the  precedence  relationship  among  the  tasks. 
Generally,  the  algorithm  gives  priority  to  tasks  within  a  given  level. 
This  algorithm  is  in  general,  subopt imal. 

(Cennal  Efe  [7]  extended  the  work  done  by  Chu,  Holloway,  Lan  and 
himself  to  find  a  heuristic  for  task  allocation  In  a  distributed  system. 
The  purpose  of  his  study  for  task  allocation  scheduling  in  a  set  of 
interconnected  processors  was  to  reduce  job  turnaround  time.  This  was  done 
by  minimizing  any  communication  between  processors.  A  distributed 
processing  system  has  conflicting  requirements;  therefore  some  compromises 
were  made  in  order  to  find  the  optimal  assignment  policy  for  a  task. 


3.0  THE  ASSIGNMENT  OF  MODULES  TO  PROCESSORS 


3 . 1  Development  of  Model 


Distributed  processing  enhances  systera(s)  performance  by  employing 
several  processors  to  handle  the  processing  load.  A  representation  of  the 
distributed  processing  system  is  shown  in  Fig.  3.1.  The  key  elements  in 
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this  system  are  a  set  of  modules  to  be  processed  {ij  , .  and  a 

module  allocation  or  a  module  assignment  mechanism,  C,  which  assigns  each 
of  the  i  modules  to  one  of  the  n  processors,  (S^,  In  general, 

the  number  of  modules  is  much  higher  than  the  number  of  processors.  There 
can  be  a  mechanism  where  the  processors  in  this  environment  communicate 
among  themselves  via  interconnection  mechanism  shown  in  Fig.  3.1.  Modules 
may  be  assigned  to  different  processors  for  the  fastest  processing  time. 

The  intermodule  communication  between  any  pair  of  modules  is 
determined  by  software  design  and  fixed  attribute  of  the  modules  at  the 
time  of  module  assignment.  We  have  to  assign  modules  to  processors  so  that 
all  processors  are  approximately  evenly  loaded.  The  example  of  the  ideal 
balanced  assignment  strategy  is  exhibited  in  Fig.  3.2  where  six  modules 
have  to  be  processed  by  three  processors,  i.e.,  these  are  the  modules  {ij, 
i2,...ig}  which  are  to  be  assigned  among  the  three  processors  (3j,  S2 ,  S3}. 
We  assume  each  module  has  identical  processing  requirements  and  processing 
time,  and  that  each  processor  has  identical  processing  abilities.  For 
simplicity,  let  us  also  assume  that  the  processing  time  of  each  processor 
is  one  minute  per  module,  i.e.,  for  the  case  Illustrated  In  Fig.  3.2,  the 
system  Is  able  to  process  the  module  assignment  in  two  minutes. 

Figure  3.2  exhibits  the  following: 

S3  will  process  ij  and  i^ 

52  will  process  ^  and  ij 

53  will  process  13  and  ig 

Thus  the  required  total  time  equals  two  minutes. 
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the  objective  Is  to  distribute  modules  In  a  manner  such  that  maximum 
number  of  modules  can  be  processed  simultaneously  for  the  maximum  system 
performance . 

The  method  used  Is  a  graphical  method  where  modules  to  be  assigned  are 
like  a  set  of  nodes  In  a  network.  We  assume  the  inter-module  communication 
time  between  the  modules  are  known,  represented  by  the  time  unit  or  by  the 
weight  of  undirected  arcs  connecting  the  nodes.  An  inter-module  or  module 
to  processor  communication  of  zero  means  that  no  communication  takes  place 
between  the  two  modules  or  between  the  module  to  processor.  They  are 
connected  in  the  network  to  show  that  there  could  be  communication.  An 
inter-module  or  module  to  processor  communication  time  of  infinity  means 
these  modules  should  not  be  processed  by  that  particular  processor.  We 
also  assume  that  all  processors  are  ready  and  available  at  all  times  for  an 
assignment. 

Therefore,  the  assignment  strategy  in  this  model  is  to  minimize  total 
processing  time  defined  as  the  sum  of  the  processing  time  with  respect  to 
the  module  connection  to  the  processor.  In  order  to  represent  the 
inter-module  communication  and/or  module  to  processor  assignment,  we 
propose  the  following  assignment  model. 

3.1.1  Assignment  Model 


Given  the  network  conf iguration  the  objective  is  to  assign  modules  to 
processors  with  the  objective  of  minimizing  total  time  where  total  time  is 
the  running  time  of  modules  in  the  processors  plus  the  intercommunication 
time  among  the  modules.  The  problem  can  be  formulated  as  a  linear 


programming  problem  as  following: 


Define  X-y  =  1  if  module  i  is  assigned  to  processor  j,  0  otherwise. 


Let  t^j  =  running  time  of  module  i  in  processor  j 
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+  \ 

I 

jeJ 

I 

^k1*  yiki£j 

subject  to 

m 

(2) 

I 

Xij 

>  l 

for  all 

jeJ 

i=l 

(3) 

n 

L 

XU 

=  l 

for  all 

iel 

J-l 

(4) 

X,  ■: 

ik3 

- 

i^j 

+  "ikHj 

i  ‘  Wll  * 

0  for  all  (i^.  i^)  and  j 

X±j  >  0  for  all  iel  and  jeJ 


The  above 

module  assigns  at  least 

one  module  to 

each 

processor 

with 

the  objective 

of 

minimizing  total 

running  time. 

If 

there 

exists  no 

constraint  as 

to 

the  utilization 

of  processors 

(i.e. 

»  not 

all 

the 
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processors  need  to  be  used),  then  one  can  simply  remove  the  first 
constraint  set  from  the  above  model  before  solving  the  problem. 

To  clarify  the  procedure,  the  following  simple,  five  modules,  three 
processors  problem  is  modeled  as  an  assignment  problem  and  solved  using  the 
available  LINDO  (Ref.  9)  package  program.  (See  Appendix  I). 

Tables  I  and  II  give  the  intermodule  communication  times  and  module  to 
processor  communication  times,  respectively. 

The  optimal  assignment  as  read  from  the  computer  output  is  as  follows: 


Module 

A 

B 

C 

D 

E 


Processor 

1 

2 

3 

3 

3 


Total  running  time  -  33  time  units- 


♦ 
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I 

I 

I 


TABLE  I.  INTER  MODULE  COMMUNICATION  TIME 


MODULE 

A 

B 

C 

D 

E 

A 

2 

3 

3 

_ 

B 

2 

2 

1 

C 

2 

7 

D 

E 

TABLE  II. 

MODULE  TO  PROCESSOR 

COMMUNICATION  TIME 

MODULE 

SI 

S2 

S3 

A 

3 

12 

15 

B 

10 

4 

12 

C 

12 

14 

4 

D 

10 

7 

5 

E 


4 


7 


3 
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3.2  Extensions  to  Model 

The  above  model  assigns  modules  to  processors  with  the  objective  of 
minimizing  total  running  time.  It  assumes  that: 

(a)  Each  processor  must  have  at  least  one  module  assigned  to  it. 

(b)  Each  processor  has  enough  capacity  to  handle  all  the  modules 
assigned  to  it. 

(c)  There  exists  only  one  of  each  type  of  module  in  the  system. 

(d)  Workload  of  the  processor  is  not  an  issue. 

The  above  assumptions  can  be  relaxed  as  the  assignment  model  can  be 

modified  to  incorporate  the  changes.  To  relax  the  first  assumption,  one 

simply  removes  (2)  which  is  the  first  constraint  set  from  the  model.  To 

relax  the  second  assumption,  one  needs  to  introduce  a  new  constraint  set 

indicating  that  the  memory  space  needed  by  the  modules  assigned  to  a 

specific  processor  must  not  exceed  the  total  available  memory  space  in  the 

processor.  Let  mi  denote  the  memory  space  required  by  module  l  and  Mj 

denote  the  total  amount  of  available  space  in  the  jth  processor.  Then, 
n 

Y  mi  Xj*  <  Mi  for  all  jeJ. 

i=l  J 

It  should  be  noted  that  the  inclusion  of  this  constraint  set  to  the 
existing  model  destroys  the  topology  of  the  model.  One  can  not  now 

guarantee  integer  solutions.  Therefore,  one  needs  to  use  some  other 
solution  methods,  like  integer  programming  methods,  to  restrict 
variables  to  have  values  of  0  or  1,  which  decrease  the  efficiency  of  the 
model  considerably.  A  similiar  problem,  called  the  job  scheduling  problem, 
with  no  communication  between  the  jobs  has  been  efficiently  solved  as  the 


knopsach  problem  and  presented  at  the  ORSA/TIMS  San  Diego  Conference 
(October  1982).  The  existence  of  Intermodule  communication  in  our  model 
prevents  us  to  use  the  above  mentioned  algorithm. 

The  third  assumption  is  the  subject  of  the  proceeding  section.  The 
last  assumption  can  be  relaxed  by  adding  another  set  of  constraints  into 
the  model  which  turn  decreases  the  efficiency  of  the  proposed  solution.  A 
more  realistic  approach  would  be  to  get  the  assignments  neglecting  the 
workloads  and  then  reschedule  modules  to  processors  in  a  way  to  balance  the 
workload  with  the  objective  of  minimizing  the  increase  in  the  total 
processing  time. 


3.3  Assignment  of  Additional  Modules 

The  utilization  rate  of  modules  plays  an  important  role  in  the 
scheduling  process.  Scheduling  highly  used  modules  to  one  processor 
increase  the  queue  length  for  that  procesor  and  in  turn  decreases  the 
realiability  of  the  whole  system.  Duplicates  of  the  module  must  be 
assigned  to  other  processors  to  resolve  this  issue.  How  many  of  each 
module  and  to  which  processors  to  be  assigned  are  the  concern  of  this 
section. 

Let  P^  denote  the  utilization  rate  of  module  i,  which  is  a 
predetermined  number.  The  expected  number  of  module  i,  E(i),  can  be 
calculated  by  multiplying  the  utilization  rate  of  module  i  by  the  available 
number  of  processors.  That  is, 

E(i)  »  Pi  x  n 


The  procedure  that  we  propose  Is  the  following:  neglecting  the  extra 
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modules.  Solve  the  assignment  problem  as  explained  in  section  2.1  and  get 
the  initial  assignments.  Then,  calculate  E(i)  for  each  model.  The 
assignment  of  extra  modules  to  the  processors  will  be  obtained  by  solving 
the  following  linear  programming  model. 


Minimize 

I  L 

iei  jeJ 

t, .  X 

ij  ij 

Subject  to 

II 

••n 

•H 

X 

E(i)  -  1 

for  all  i 

XU  = 

0  if 

module  i  is  assigned  to  processor 
j  in  the  previous  assignment  problem 

Xij  ± 

1  for 

all  i  and  j. 

where  tj  is  the  processing  time  of  module  l  in  processor. 


The  above  program  assigns  nne  extra  modules  to  processors  with  the 
objective  of  minimizing  total  processing  time. 


Table  III  gives  the  utilization  probability  for  each  module  and  the 
expected  number  of  modules  required  from  each  module  for  the  problem  given 
in  Table  IV  and  is  taken  to  be  the  smallest  integer  greater  than  or 


I 


equal  t^  the  product  x  n. 
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TABLE  III 


MODULE  Pi  Ei 


A  .2  1 

B  .5  2 

C  .8  3 

D  .4  2 

E  .7  3 


The  assignment  model  and  the  solution  for  the  above  problem  is  given  in 
Appendix  II.  Table  IV  summarizes  the  solution. 


TABLE  IV 


MODULE 


PROCESSOR  (S)  TIME 

it  OF  MODULES  ASSIGNED  REQUIRED 


A 

1 

Si 

3 

B 

2 

Si ,  s2 

14 

C 

3 

Sj,  s2,  s3 

30 

2 
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3 . 4  Scheduling  Modules  To  Processors  Using  Network  Flow 

Approach 


There  exists  a  close  relationship  between  assignment  and  network  flow 
models.  Simply,  one  is  the  dual  of  the  other.  Therefore,  the  scheduling 
problem  can  be  attacked  as  a  network  flow  problem  in  the  following  way. 

Let  the  processors  and  modules  be  the  nodes  of  the  network.  A  set  of 
arcs  connect  modules  to  modules  and  modules  to  processors.  If  arc  (i,j) 
connect  module  i  to  module  j  then  t^  indicate  the  intercommunication  time 
between  modules  i  and  j.  On  the  other  hand,  if  arc  (i,j)  connects  module  i 
to  processor  j  then  t^j  is  calculated  as 


t 


ij 


I 

keJ 


'ik 


-  t 


IL 


n-l 


that  is,  t^j  is  the  sum  of  processing  times  of  module  i  in  the  processors 
other  than  j  less  the  processing  time  of  module  i  in  processor  j  divided  by 
number  of  processors  minus  one.  Figure  3.3  is  an  example  to  such  network 
configurations.  The  problem  of  assigning  modules  to  processors  with  the 
objective  of  minimizing  total  time  can  now  be  translated  to  the  problem  of 
finding  a  raulttcut  with  minimum  value.  The  set  of  modules  in  the  subset  of 
a  processor  will  be  assigned  to  that  processor. 


The  procedure  for  n  processor  m  module  problem  can  be  summarized  as 


follows : 
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Figure  3.3  An  example  to  the  network  representation 
of  the  problem. 


Definition:  A  multicut  partitions  the  graph  into  n  disjoint  sets  where 
each  set  contains  one  and  only  one  processor.  No  proper 
subset  of  this  cut  is  also  a  multicut. 

Figure  3.4  is  an  example  of  a  multicut  for  the  given  networ. 


processor 

module 


Figure  3.4  Example  of  a  multicut. 
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1.  Construct  the  general  network.. 

2.  For  each  processor  S^,  find  the  maximum  flow  from  processor 

to  all  other  processors.  Store  the  minimal  cut.  At  this 
step  n  maximum  flow  problems  will  be  solved  and  n  minimum 
cuts  will  be  located. 

3.  Form  n-1  multicuts. 

4.  The  multicut  with  the  minimal  value  generates  the  desired 
assignment . 


3.4.1  Flowchart  For  The  Algorithm 

I.  INITIALIZATION 

Given  the  network  configuration,  read  and  store  the  data. 

Set  MIN  =0,  KK  =  0 

II.  MAXFLOW 

For  each  processor  Si:  Maximize  the  flow  from  3^  to  all  other 

processors  (LINDO  software  package  is  used  at  this  step).  Retain  the 
minimal  cut,  Cj  . 

III.  MULT I CUT 

Using  the  n  minimal  cuts,  (C's)  generate  (n-1)  multicut.  TTj 
denotes  the  value  of  the  minimal  cut  C^.i.e.,  the  time  of 
’•unning  modules  in  the  assigned  processor  (s).  TTMC;  denotes 
the  value  of  the  jth  multicut. 

IV.  MINIMUM  MULTICUT 


Among  the  n  generated  multicuts  locate  the  cut  with  minimum  capacity. 
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I 

^  3.4.2  Example  1.  Two-Processor  Problem 

I  Consider  the  network  of  Fig.  3.5.  The  inter-module 

communication  times  for  this  network  are  shown  in  Table  V.  The 
module-to-processor  communication  times  are  shown  in  Table  VI. 

The  objective  is  to  minimize  the  total  absolute  running  time  of  a 
program  in  the  network.  The  network  in  Fig.  3.6  can  then  be 
constructed  with  nodes  for  each  of  the  modules  and  inter-module 
communication  times  on  the  arc  joining  the  nodes.  In  order  to 
represent  the  processing  time,  two  additional  nodes  are  added  to  the 
network  to  represent  the  two  available  processors  Sj  and  S7 .  The 
running  time  of  each  module  on  processor  Sj  is  denoted  by  the  arc 
joining  that  module  node  to  the  node  .  Similiarly,  running  time  of 
each  module  on  processor  So  is  denoted  on  the  arc  joining  that  module 
node  to  the  S2- 

If  a  max-flow-min-cut  algorithm  [8]  is  performed  on  the  network 
of  Fig.  3.6,  the  cut  shown  by  the  heavy  dark  line  on  Fig.  3.7  is 
obtained.  As  shown  in  Fig.  3.7,  modules  A,  B,  C,  D,  and  E  are 
assigned  to  processor  Sj  while  module  F  is  assigned  to  processor  S2 
(See  Appendix  III  for  computations). 

While  this  method  is  attractive  in  its  simplicity,  it  has  several 
limitations.  The  basic  min-cut  solution  provides  for  a  minimum  time 
assignment  between  two  processors.  In  general,  an  execution  of  this 
method  to  an  arbitrary  number  of  processors  requires  an  N-  dimensional 
j  mini  cut  algorithm,  which  quickly  becomes  computationally  intractable. 

This  limits  the  usefulness  of  the  method  in  many  applications.  An 
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extension  of  this  method  to  allow  assignments  among  three  processors 
or  more  is  now  proposed. 

3.4.2  Example  2.  Three-Processor  Problem 

The  interconnection  graph  for  a  three-processor  and  five  module 
assignment  is  shown  in  Fig.  3.8.  This  network  is  used  with 
undirected  arcs  to  find  the  relationship  between  processors  and 
modules.  Our  research  has  shown  that  the  maximum  flow  algorithm  finds 
an  optimal  position  for  a  three-processor  system. 

Table  VII  shows  the  time  each  module  will  take  with  a  given 
processor.  Therefore,  when  the  program  execution  begins,  the  floating 
modules  will  be  assigned  to  the  processor  which  will  process  where  the 
computation  time  is  minimum.  As  an  example,  the  best  guess  for  module 
A  will  be  processor  Sj  and  for  module  B  will  be  processor  S2 ,  etc. 
The  weight  of  a  branch  is  the  total  time  charged  to  intermodule 
references  represented  by  the  branch. 

Therefore,  if  k  references  between  two  modules  occur  during  the 
running  of  a  program  and  each  reference  takes  t  seconds,  then  when  the 
modules  are  assigned  to  a  particular  processor,  the  weight  of  the 
branch  representing  these  references  is  kt. 
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Since  our  objective  is  to  minimize  the  total  running  time  of  a 
program,  on  a  given  processor,  the  only  time  factor  taken  into 
consideration  is  ill  Table  VII,  which  gives  the  total  running  time  on 
each  processor.  Also,  in  a  distributed  computing  system,  there  is  no 
parallelism  of  module  execution  within  a  program.  Therefore,  Table 
VII  gives  the  total  running  time  of  the  modules  on  their  assigned 
processors.  The  problem  was  set  having  undirected  arcs. 

By  solving  the  maximum  flow  problem  on  the  network  one  gets  the 
minimum  weight  cutset  which  determines  the  module  assignment.  This 
indicates  that  an  optional  assignment  can  be  found  by  running  a 
maximum  flow  algorithm  on  the  network.  The  cutset  will  be  defined 
later. 

At  this  point,  the  dummy  processor  with  infinity  flow  going 
into  two  of  the  processors  is  added.  This  means  that  when  processor 
S3  is  active  the  infinity  flow  from  will  go  to  processor  S2  and 
processor  S3.  Infinity  flow  indicates  that  the  module  cannot  be 
assigned  to  that  particular  processor.  This  was  done  to  each 
processor  in  the  problem  (See  network  in  Figs.  3.9,  3.10,  3.11). 

The  flow  was  minimized  via  the  constraints  related  with  the 
modules  and  its  intermodule  references  represented  by  the  branches. 
When  Sj  was  selected  to  be  the  active  processor  and  the  dummy 
processor  S^  was  connected  to  S2  and  S3  respectively,  the  cut-set  was 
17.  This  includes  the  branches  BA,  ES3 ,  DA,  CA  and  DSj .  (See  Fig. 
3.9  and  computer  print-out  in  Appendix  IV-A). 
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The  same  was  done  to  the  other  processors.  This  time  So  was 
active  and  dummy  processor  was  connected  to  Sj  and  S3  respectively. 
The  cut-set  in  this  case  was  27  and  included  branches  AB,  CB,  DB ,  EB 
and  DS2  (See  Fig.  3.10  and  computer  print-out  in  Appendix  IV-B). 

When  S3  was  active  and  dummy  processor  was  connected  to  Sj  and 
S2  respectively,  the  cut-set  was  26.  The  branches  include  AC,  BC,  DC, 
EC,  DS3  and  ES3  (See  Fig.  3.11  and  computer  print-out  in  Appendix 
IV-C) . 

Now  consider  again  the  example  of  three-processor  and  five  module 
network  shown  in  Fig.  3.8,  with  the  running  times  for  each  processor 
given  in  Table  VII. 

The  linear  programming  model  was  used  to  determine  the  cut-set 
when  all  the  three  processors  were  active.  The  value  of  the  objective 
function  was  found  to  be  38,  which  gave  the  following  cut-set  (See 
Fig.  3.12). 


In  this  cut-set  the  relationship  between  Fig.  3.9  and  Fig.  3.10 
is  shown,  where  the  cut-set  values  were  17  and  27  respectively.  The 
total  of  these  two  cut-sets  equals  44.  Subtracting  the  common  arc 
which  is  AB  with  a  processing  time  of  6  yields  38,  the  present  cut-set 
value.  Similarly  from  Fig  3.9  the  cut-set  value  of  7  minus  the  arc  A3 
of  6  gives  a  value  of  11,  which  when  combined  with  Fig.  3.10  whose 
cut-set  value  of  27  gives  the  present  cut-set  value  of  38. 

Considering  the  above  factors  and  the  distribution  of  modules  to 
processors  In  Fig.  3.12  the  following  assignment  is  the  solution. 
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Module 

A 

to 

Processor 

S1 

Module 

B 

to 

Processor 

*2 

Module 

C 

to 

Processor 

s3 

Module 

D 

to 

Processor 

s3 

Module 

E 

to 

Processor 

s3 

The  linear  programming  model  constructed  was  used  with  the 
LIND0[9]  software  package  to  get  the  above  assignment  solution  (See 
Appendix  IV). 


4.0  SCHEDULING  ALGORITHM 


4.1  Multiprocessor  Scheduling  Algorithm 


Assumptions : 

The  following  are  the  assumptions  in  connection  witli  the 
developemnt  of  the  model. 

1.  The  algorithm  assumes  that  each  module  sends  its  received  message 
forward,  along  a  chosen  link  to  the  processor.  This  means  that 
when  a  module  has  to  be  processed  it  does  to  the  processor  along  a 
link  for  best  execution  time. 

2.  The  processors  in  the  network  are  homogeneous  and  they  are  fully 
connected.  This  means  that  all  the  modules  in  the  network  are 
fully  connected  to  the  processors.  Except  for  any  attached 
modules  which  must  be  processed  by  a  particular  processor,  in 
general  the  modules  could  go  to  any  processor  for  processing. 
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3.  The  message  eventually  arrive  at  their  final  destination,  taking 
into  consideration  the  fastest  computation  time  on  any  given 
processor.  This  means  that,  when  a  module  has  to  be  processed,  it 
should  be  processed  by  the  processor  which  requires  the  minimum 
amount  of  time. 

4.  The  module-to-processor  communication  time  is  known  and  given. 

5.  The  number  of  modules  in  a  network  is,  in  general,  much  higher 
than  the  number  of  processors. 


It  is  noted  in  this  connection  that  the  efficiency  goes  down  by 
increasing  the  number  of  processors,  and  hence  only  three  processors, 
an  80%  efficiency  level  has  been  used.  This  observation  was  based  on 
research  done  at  Digital  on  2  DEC  Systems,  verifying  that  when 
additional  processors  are  included  after  the  third  procesor  to  the 
system,  the  efficiency  of  the  system  goes  down  [2J. 

The  general  procedure  for  the  proposed  heuristic  in  now  defined. 


STEP 

Establish  the  computational  time  constant.  This  constraint 
assumes  that  a  module  will  be  assigned,  only  to  the  processor 
which  takes  the  least  amount  of  time  for  processing,  if  it  is 
not  an  attached  module. 

STEP  2. 


Establish  the  load  balancing  constraint.  In  our  case  we 
assume  that  a  particular  processor  at  any  given  time  should 
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not  process  more  than  10  Million  instructions/sec  and 
less  than  0.5  Million  instruction/sec,  that  is  0.5  Million  < 

Inf ormation/sec<10  Million.  In  the  first  case  it  would  be 
considered  to  be  overloaded  and  in  the  latter 
case  it  would  be  considered  to  be  underloaded. 

STEP  3j_ 

Establish  the  priority  constraint.  In  some  cases  in  a 
network,  some  modules  may  have  to  be  assigned  to  one  particular 
processor  in  order  to  be  processed,  taking  into  consideration 
their  unique  capabilities.  In  this  respect  we  could  first  assign 
those  modules  to  the  particular  processor  and  then  start  from 
Step  4. 

STEP 

Set  up  the  precedence  relationships  among  the  module  to  be 
processed.  This  will  identify  which  module  should  be  processed 
first  for  a  given  program  or  operation. 

STEP  5^ 

For  each  program  or  operation,  create  a  set  of  I  number  of 
modules  through  which  that  program  has  to  pass,  to  be 
processed.  Set  up  the  precedence  relationship(s)  for  each 
program  or  operation  using  the  relationship( s)  established 
in  Step  4. 

i 
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STEP  6. 


In  relation  to  the  overall  performance  of  the  procedure 
discussed  for  this  problem,  one  should  consider  the 
utilization  rate  of  the  functions.  This  means  finding  out 
which  modules  in  the  network  occurs  most.  Then  the  modules 
are  to  be  set  accordingly,  to  carry  out  those  particular 
functions.  Also,  it  is  required  to  utilize  as  many 
modules  as  possible  to  handle  the  most  frequent  functions. 

STEP  7. 

It  is  assumed  that  all  the  available  processors  have  the 
capability  to  process  each  of  the  above  modules  which  occur 
frequently  can  be  assigned  to  any  available  processor  for  the 
fastest  computation  time. 

STEP  8_._ 

Assign  the  set  of  the  I  modules  formed  in  Step  5  to  processors 
so  that  those  modules  could  be  processed  fastest.  Take 
computation  time  on  a  given  processor  into  consideration. 

STEP  9^ 

Make  the  above  assignment  to  separate  processors  so  that  the 
load  balancing  is  maximized,  without  violating  the 
precedence  relationships- 
STEP  LO. 

Choose  the  mth  assignment  (corresponding  to  I  ■  m)  from 
Step  8,  and  if  this  assignment  satisfies  the  load  balancing 
constraint  and  the  computation  time  constraint  along 
with  the  attached  module  constraint  (if  any),  then  stop; 
otherwise  go  to  Step  11. 


42 


STEP  11. 

Identify  the  overloaded  and  underloaded  processors. 

STEP  12. 

Assign  some  modules  from  overloaded  processors  to  underloaded 
processors  taking  Steps  1,  2,  and  3  into  consideration. 

Then  go  to  Step  10. 


The  algorithm  presented  here  provides  a  scheduling  scheme  for  the 
assignment  of  different  modules  to  available  processors,  to  minimize  the 
total  time  on  a  given  processor,  on  a  distributed  system. 


5.0  SUMMARY 

The  problem  of  finding  an  assignment  of  modules  to  processors  to 
minimize  the  total  processing  time,  where  the  module-to  -processor 
communication  time  is  given,  has  been  considered.  The  algorithm  presented 
in  this  report  provides  a  scheme  for  assignment  of  modules  to  processor  in 
a  distributed  computer  system. 

The  model  represents  modules  as  nodes  in  a  graph  and  the  module- 
to-processor  communication  times  as  weights  on  the  undirected  arcs 
connecting  modules  to  processors.  The  methodology  is  to  assign  modules  to 
processors  so  that  total  processing  time  is  minimized.  The  standard 
max-flow-  min-cut  algorithm  [8]  can  obtain  the  required  assignment. 
However,  the  computation  becomes  intractable  for  a  large  number  of 


processors. 


The  proposed  linear  programming  model  minimizes  total  processing  time 
subject  to  resource  limitation  constraints  described  in  Section  4.0. 
Moreover,  a  procedure  has  been  shown  for  handling  additional  modules 
introduced  to  the  network,  taking  the  utilization  rate  into  consideration. 
The  utilization  rate  of  each  function  which  is  nothing  but  a  function  of 
each  module,  can  be  found  simply  by  executing  a  test  run  before  the  initial 
setting  up  of  the  computer  systems.  This  could  also  be  updated 
periodically. 

The  heuristic  models  provide  appropriate  solutions  to  the  module 
assignment  problem.  The  basic  idea  of  heuristic  methods  is  to  find  an 
assignment  for  the  modules  to  processors  and  to  balance  the  load  among  the 
processors.  Load  balancing  is  an  important  problem  in  distributed  computer 
systems,  a  problem  which  is  yet  far  from  being  solved  completely.  Also, 
the  question  remains  how  the  relationship  will  be  expressed  between  the 
precedence  relations  and  the  queuing  delay.  These  are  definitely  important 
problems  requiring  further  research. 
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FS  IE 

-  FS  IF 

3) 

FS1A  f 

FBA  *  FFA  f 

FCA  - 

FAB  - 

FAF  - 

FAC  - 

4) 

FS  IB  - 

FBA  f  FAB  + 

FEB  f 

FDB  - 

FBS2 

•  FBE  - 

5 ) 

FS1C  - 

FCA  f  FAC  + 

FEC  - 

FCS2  - 

FCE 

-  0 

4) 

FS1D  - 

FDB  >  FBD  + 

FED  - 

FDS2  - 

FDE 

=  0 

7) 

FS1E  - 

FEB  +  FBE  - 

FEC  f 

FCE  - 

FED  1 

FDE  • 

8) 

FS  IF  - 

FFA  +  FAF  ~ 

FFS2  = 

0 

9) 

-  F  f  FAS2  +  FBS2  4- 

FCS2  + 

FBS2  f 

FES2 

+  FFS2 

10) 

FS1A  <= 

10 

1  .1 ) 

FS1B  <= 

10 

12) 

FS1C  <= 

4 

13) 

F  SI B  <« 

3 

14) 

FS.1E  <== 

•  12 

15) 

FS1F  <- 

4 

14) 

FAB  <= 

9 

1 7 ) 

FBA  <> 

9 

18) 

FAC  <= 

4 

19) 

FCA  <> 

4 

20) 

FAF  <= 

12 

2 1 ) 

FFA  <= 

12 

22) 

FBD  <  = 

12 

23) 

FDB  <* 

12 

24) 

FBE  <= 

3 

23) 

FEB  <= 

3 

24 ) 

FCE  <- 

1.1 

27) 

FEC  <- 

11 

28) 

FBE  <- 

5 

29) 

FED  <= 

5 

30) 

FAS 2  <» 

5 

31) 

FBS2  <== 

2 

32) 

FCS2  <•- 

4 

33) 

FDS2  <~ 

6 

34) 

FES 2  <- 

5 

35) 

FFS2  <= 

18 

0 

F  AS2  = 
FBD 


FES  2  = 


l-P  OPTIMUM  FOUND  AT  STEP  17 
OBJECTIVE  FUNCTION  VALUE 
38 .OOOOO 


i 


57 


i 

t 

i 


VARIABLE 

F 

FS  LA 

FS  IB 

FS  1C 

FS  ID 

FS  IE 

FS  IF 

FBA 

FFA 

FCA 

FAB 

hAf- 

FAC 

FAS2 

FFB 

FOB 

FBS2 

FBE 

FBD 

FEC 

FCS2 

FCE 

FED 

FDS2 

FDE 

FES2 

FFS2 


VALUE 

38.000000 

10.000000 

9.000000 

4.000000 

3.000000 

8.000000 

4.000000 

7.000000 

0.000000 

0.000000 

0.000000 

ii;ouuouu 

0.000000 

5.000000 

0.000000 

0.000000 

2.000000 

0.000000 

0.000000 

0.000000 

4.000000 

0.000000 

3.000000 

6.000000 

0.000000 

5.000000 

16.000000 


REDUCED  COST 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
1.000000 
0.0000 00 
0 . ocoooo 

V  iWWVUV 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 


ROW 

SLACK 

DUAL  PRICES 

2) 

0.000000 

1.000000 

3) 

0.000000 

1 .000000 

4) 

0.000000 

1 .000000 

5) 

0.000000 

1 .000000 

6 ) 

0.000000 

1 .000000 

7) 

0 .000000 

1 .000000 

8) 

0.000000 

0.000000 

9) 

0.000000 

0.000000 

10) 

0.000000 

0.000000 

1  1 ) 

1.000000 

0.000000 

12) 

0.000000 

0.000000 

1  3 ) 

0.000000 

0.000000 

1  4) 

4.000000 

0.000000 

15) 

0.000000 

1.000000 

1  6 ) 

9.000000 

0.000000 

J  7) 

2.000000 

0.000000 

1  8) 

4.000000 

0.000000 

1.  9 ) 

4.000000 

0.000000 

20  > 

0.000000 

1 .000000 

2  1  ) 

12.000000 

0.000000 

22) 

12.000000 

0.000000 

23) 

12.000000 

0.000000 

1 


I 

I 
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24) 

3.000000 

0.000000 

25) 

3.000000 

0.000000 

2A) 

11 .000000 

0.000000 

27) 

11 .000000 

0.000000 

20) 

5.000000 

0.000000 

29) 

2.000000 

0.000000 

30) 

0.000000 

1 .000000 

31 ) 

0.000000 

1 .000000 

32) 

0.000000 

1 .000000 

33) 

0.000000 

1 .000000 

34) 

0.000000 

1 .000000 

35) 

2.000000 

0.000000 

NO.  ITERATIONS* 

17 

DO  RANGE (SENSITIVITY)  ANALYSIS? 
N 


OI.J  IT 


60 


APPENDIX  IV- A 


'JAN';  PROCESSOR  Sj  IS  ACTIVE  AND  THE  DUMMY  PROCESSOR 
SA  IS  CONNECTED  TO  PROCESSOR  S2  &  S3 
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APPENDIX  IV 

NETWORK  FLOW  APPROACH  TO  THREE  PROCESSOR  PROBLEM 


i 


( 

* 


I 
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.  .  R  LIN'D  OS 
LIN'DO  (  U! 
:  *HETR 
FILE  N‘  A ' !  E  = 
*JAN 
:  *  L  0  0  X 
SO'..' 

>  *  ALE 

MAS  F 

SUBJECT  TO 


3  AUG  7  9  ) 


3) 

F  A  S  1  + 

3) 

FAB  + 

- 

FS  2  3  = 

4) 

FAC  + 

- 

FS  3  C  = 

5) 

FAD  + 

= 

0 

6  ) 

-  F3S  + 

7) 

?  -  F 

S) 

-  F33  2  + 

9) 

-  FCS  3  + 

10) 

-  F  -i-  F 

11) 

FAS1  < 

12) 

“ESI  < 

13) 

FDS1  < 

14) 

FS4S2 

13) 

FS4S3 

1  o  ) 

FAB  <  = 

17) 

F  3  A  <  = 

13) 

FAD  <=» 

19) 

FDA  <  = 

20) 

FAC  <  = 

21) 

F  C  A  <  = 

2  2 ) 

FDS2  <. 

23  ) 

FS  2D  <. 

24) 

FD  B  <  = 

25) 

F  3  D  <  =* 

26) 

FDC  <  = 

27) 

FCD  <  = 

28) 

FDS3  O 

20) 

FS  3D  o 

30  ) 

FC3  <- 

31) 

FBC  <  - 

32  ) 

FC S  3  <  = 

33) 

FS3C  << 

34  ) 

FCE  <  = 

35  ) 

FEC  <  = 

3  6  ) 

F3E  <  = 

2  7  ) 

F  E  3  <  = 

33) 

F  E  S  3  <  > 

39) 

F  S  3  E  <  * 

40  ) 

F  3  S  2  <  » 

4  1  > 

F  5  2  5  <  = 

FB  A  + 
0 

FC  A  - 
0 

FDA  - 

FE3  - 
IS1  - 
F  5  2  3 
F  3  3  C 
!  4  S  2  + 


+■  FAC  - 
FBC  + 

FBC  + 

FCD  4- 

FCE  + 
FDS1  - 

-  FDS2 

-  FDS  3 
FS4S3  = 


FAD  - 
F3D  + 

F3  A  - 
FB  E  + 

FCA  - 
F  3  S  2  - 

FDA  = 
FCS  - 

0 

FDB  -  FEB 

FC  3  +■ 

FCD  + 

FCE  + 

FCS  3  - 

FDC  -  FEC 

FDC  + 

FDS  1 

FD  S  2  + 

FD  S  3 

-  FS  2  D  - 

FEC  + 

FES  1  + 

FES3  - 

FS  3  E 

=  0 

FS  2D  - 
FS  3D  - 
0 


F34S2 
FES  3  - 


0 

FS3E  - 


FS4S3  = 


1000 

3 

1 

1000 

1000 


6 

6 

4 

4 

3 

3 


1000 

1000 


t.:,D 

:  *GO 


1000 

1000 


62 


LP 

1) 

VARIA3L 

F 

FAS  1 
FA3 
FAC 
FAD 
F3A 
FCA 
FDA 
F3C 
F3D 
F3E 
F3S2 
FC3 
FDB 
FEB 
FS  2  3 
FCD 
FCE 
FCS  3 
FDC 
FEC 
FS  3  C 
F3S  L 
FDS2 
FDS  3 
FS  2 D 
F  5  3  3 
FES1 
FES3 
F  3  3  E 
F3432 
F  S  2  S  3 
RON 
2) 

3  ) 

4  ) 

5  ) 
o  ) 

1  ) 

3) 

9) 

;o) 

1  I  ; 

1  2  ) 


3  ) 


:  ) 

i  , 


OPTIMUM  FOUND  AT  STEP  17 

OBJECTIVE  FUNCTION  VALUE 
1 7 . 00000 


VALUE 

REDUCED  COST 

L  7 .000000 

0.000000 

13 .000000 

0  .  oooooo 

0.000000 

1  .  oooooo 

0.000000 

1  .  oooooo 

0.000000 

1  .  oooooo 

6.000000 

0.000000 

3 . 000000 

0.000000 

4 . 000000 

0.000000 

2 . 000000 

0.000000 

0  .  oooooo 

0.000000 

0.000000 

0.000000 

0  .  oooooo 

0.000000 

0.000000 

0.000000 

0.000000 

0.000000 

0.000000 

0  .  oooooo 

3.000000 

0 . oooooo 

0.000000 

0.000000 

0.000000 

0  .  oooooo 

0 . oooooo 

o . oooooo 

0.000000 

0.000000 

0 . 000000 

0 .oooooo 

1 . 00000 0 

0.000000 

1 .oooooo 

0 .oooooo 

0.000000 

0.000000 

o .oooooo 

0.000000 

4.000000 

0  .  oooooo 

L  .000000 

0  .oooooo 

3.000000 

0.000000 

0.000000 

0  .oooooo 

3.000000 

0.000000 

12.000000 

0  .oooooo 

5.000000 

0.000000 

SLACK 

DUAL  PRICES 

0.000000 

1  .oooooo 

0.000000 

0  .oooooo 

0.000000 

0  .  oooooo 

0.000000 

0  .oooooo 

0.000000 

0 .oooooo 

0.000000 

1 . oooooo 

0  .  oooooo 

0  .oooooo 

0.000000 

0  .  oooooo 

0.000000 

0.000300 

037.0000 0 0 

0.000300 

0.000000 

1 . oooooo 

3.000 0 00 

1 .0000 0 0 

0  33.  OOOOOO 

0  .  oooooo 

30  5 . 0  0000') 

0 . oooooo 

•3  .  )  30000 

0 . oooooo 

3  . 1)000  )  0 

l  .  .3000  0  ) 

3 . 0000  )  ) 

) . OOOOOO 

0. OoOOOO 

1  . 3000  )0 
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2  0  ) 

3.000000 

0 . 000000 

21  ) 

0.090000 

i .  o  o  o  o  o  n 

22  ) 

4.000000 

0.000000 

2  3  ) 

0.000000 

0.000000 

24  ) 

7 .000000 

0 . 000000 

25  ) 

7.000000 

0 . 000000 

2  !■< ) 

5.000000 

0 . 000000 

2/) 

5.000000 

0.090000 

23) 

6 .000000 

0.000000 

29) 

5.000000 

0.000000 

30) 

2.000000 

0.00 0 0 0  0 

3  1  ) 

0.000000 

0.000000 

32) 

1000.000000 

0.000000 

33  ) 

999.000000 

0.000000 

34  ) 

6 .000000 

0.000000 

35) 

6.000000 

0 . 000000 

36) 

3.000000 

0.000000 

37) 

3.000000 

0.000000 

33) 

4 . 000000 

0.000000 

39  ) 

1.000000 

0.000000 

40) 

1000 . 000000 

0.000000 

41  ) 

992.000000 

0.000000 

no. 

ITERATIONS-  17 

DO 

RANGE! SENSITIVITY)  ANALYSIS? 

>*::o 
:  *oc:t 

STOP 
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APPENDIX  IV-B 

PROCESSOR  S2  IS  ACTIVE  AND  THE  DUMMY  PROCESSOR 
S4  IS  CONNECTED  TO  PROCESSOR  S1  &  S3 


4 


l 


* 

\ 


I 
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end 

:  *Go 


L? 

OPTIMUM  FOUND  AT  ST 

EP  20 

OBJECTIVE  FUNCTION 

VALUE 

1  ) 

2  7 .00000 

VAR : A3 

LE  VALUE 

REDUCED  COST 

F 

27.000000 

0 . 000000 

.K  A  5  . 

0. 000030 

0.000000 

FAS 

o. 000000 

0.000000 

?  A.  C 

3 . 000000 

0 . 000000 

FA¬ 

4.000000 

0 .000000 

RSI.-. 

13.000000 

0 . 000000 

F  3  A 

0.000000 

1 . 000000 

FC  A 

0.000000 

0 . 000000 

FDA 

0.000000 

0.000000 

F3S2 

23.000000 

0.000000 

FSC 

0.000000 

1 .000000 

FED 

0.000000 

1 . 000000 

F3  2 

0.000000 

1 .000000 

F  C  S 

2.000000 

0 . 000000 

FDE 

7.000000 

0.000000 

FEE 

8.000000 

0 .000000 

FCD 

0.000000 

0.000000 

FCE 

1 .000000 

0 . 000000 

FCS3 

0.000000 

0 . 000000 

FDC 

0.000000 

0 . 000000 

FSC 

0.000000 

0 . 000000 

F  5  3  C 

0.000000 

0 .000000 

FDS2 

4.000000 

0 . 000000 

FDS  2 

0.000000 

0 . 000000 

FD  S  1 

0.000000 

0 .000000 

FS3D 

6.000000 

0.000000 

FSiD 

1 .000000 

0.000000 

FESl 

0.000000 

0.000000 

FES3 

0.000000 

0.000000 

FSi  £ 

3.000000 

0 . 000000 

F33E 

4.000000 

0.000000 

F5 4 3  I 

17.000000 

0.000000 

F  S  4  5  3 

10.000000 

0.000000 

FES4 

0.000000 

0.000000 

R  0  W 

SLACK 

DUAL  PRICES 

2) 

0.000000 

0 . 000000 

3) 

0.000000 

i . 000000 

4) 

0.000000 

0.000000 

5  ) 

0.000000 

0.000000 

6  ) 

0.000000 

0.000000 

7  ) 

0.000000 

0 . 000000 

j ' 

0.000000 

1.000000 

9  ) 

0.000000 

0 . 000000 

1 3 

0 .000000 

0 . 000000 

i  i  i 

25.000000 

0 . 000000 

12  > 

12 .000000 

0.000000 

13) 

0.000000 

0 . 00 0000 

I  4  ) 

• . 0  0 0000 

0 .000000 

15) 

0.000000 

1 . 000000 

< 
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i  3  ' 

6 .000000 

0 .000000 

i  7  ) 

0.00 0 0 00 

0 . 000000 

1  o  ) 

3.0011  o  o  o 

0.000000 

I  9  ) 

2 . 000000 

0.000000 

2  9) 

0 . OOOOJO 

1  •  000000 

2  I  ) 

7.000000 

0.000000 

2  2  i 

0.0  )  0  0  JO 

1  .000000 

2  3 

•3.0000  00 

0.000000 

2  4) 

0.000000 

i  .  0  0  0  0  0  0 

2  5  ) 

5 .000000 

0.000000 

2  o 

5.000000 

0.000000 

2  /  ) 

5 . 000000 

0.000000 

23  ) 

6.000000 

0.000000 

29  ) 

25.000000 

0 .000000 

30  ) 

25 .000000 

0.000000 

3  1  ) 

1 . 000000 

0.000000 

32  ) 

0.000000 

0 . 0  •'1 0  0  0  0 

33) 

34) 

6 . 000000 

0.000000 

0.000000 

0.000000 

35  ) 

4.000000 

0 . 000000 

3  6  ) 

3/) 

0.000000 

0 .000000 

3 . 000000 

0 . 000000 

3  3 ) 

0.000000 

0.000000 

39 ) 

2.000090 

0.090000 

20  ) 

0.090000 

1 .000000 

2  1  ) 

983.000000 

0.00000c 

2  2  ) 

990 . 000000 

0 . 0  0  0  0  0  0 

2  3  ) 

2 . 000000 

0.000000 

:;o.  i t 2 ?„.\t i o ; 
do  ra.oge (St: 

:s=  20 

;sit:vitv)  a v. a  l  v  s  i  s ? 

>*::o 
:  *o'j:t 
sto  ? 


68 


'JANAK' ; 


APPENDIX  IV-C 

PROCESSOR  S3  IS  ACTIVE  AND  THE  DUMMY  PROCESSOR 

S.  IS  CONNECTED  TO  PROCESSOR  S,  &  S„ 

4  12 


< 
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liedo  (uc3aug79) 

:*RSTR 

file  ;;a;ie= 

:  *lco;; 

RO'.; 

>*ALL 


MAX  F 

SUBJECT  TO 


2) 

FAB  -r 

FAC  + 

FAD  -*• 

FAS1  - 

FBA  - 

FCA 

-  FDA  - 

FS1A  =  0 

3) 

-  FA3  + 
FS23  = 

FBA  + 
0 

FBC  + 

FDD  +  F 

BE  + 

FBS2 

-  FCB  - 

FDB  -  FEB 

4) 

-  FAC  + 
o 

F  CA  — 

F3C 

FC3  i-  F 

CD  + 

FCE  + 

FCS3  - 

FDC  -  FEC 

5) 

-  FAD  + 
FS1D  - 

F  DA  ~ 
FS2D  = 

FED  -r 

0 

FD3  -  F 

CD  -r 

F  DC  *** 

FDS1  + 

FDS2  +  FDS3 

6) 

-  FSE  + 

FE3  - 

FCE  -r 

FEC  +  F 

ESI  + 

FES3 

-  F31E 

=  0 

7) 

-  FA31  + 

■  FS1A 

-  FDS1 

-r  FS1D 

-  FE 

SI  + 

FS1E  -  F 

S4S1  =  0 

3) 

9) 

-  F3S2  - 
F  -  F 

FDS2 
CS3  - 

—  "  r  2  J 

FDS3  - 

-  FS23 
FES3  = 

-  FS4S2  = 

0 

0 

10)  -  F  +  FS4S1  +  FS432  »  0 

11)  FA31  <=  25 

I-'  FS1A  <=  25 

13  J  FA3  <=  6 

14)  FBA  <=  6 

15)  FAC  <=  3 

16)  FCA  <=  3 

17 )  FAD  <=  4 

13)  FDA  <-  4 

19)  F3C  <=  2 

20)  FC3  <=  2 

21)  F3D  <=  7 

22)  FDB  <=  7 

23)  FBE  <=  8 

24)  FEB  <=  3 

25)  FBS2  <-  25 

26)  FS23  <=  25 

27)  FCD  <-  5 

23)  FDC  <-  5 

29)  FCE  <=  6 

30)  FEC  O  6 

31)  FDS 1  <«  1 

32)  FS1D  <=  1 

33)  FDS2  <»  4 

34)  F321)  <=  4 

35)  FES  1  <-  3 

36)  FS1E  O  3 

37)  FCS3  <-  25 

38)  FDS 3  <»  6 

39)  FES3  <»  4 


I 
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40)  F34S2  <=  1000 

41)  FS4S1  <=  1000 

e:;d 


:  *C0 


lp  optimum 

foiled  at  ste 

P  IS 

03  JSC 

FIVE  r  UE'CTIOK 

VALUE 

1)  2 

6.00000 

VARIABLE 

VALUE 

REDUCED  COST 

? 

26.000000 

0.000000 

FAB 

0.000000 

0.000000 

FAC 

3.000000 

0.000000 

FAD 

4.000000 

0.000000 

FAS1 

0.000000 

0.000000 

FBA 

0.000000 

0.000000 

FCA 

0.000000 

1.000000 

FDA 

0.000000 

0.000000 

FS1A 

7.000000 

0.000000 

F3C 

2.000000 

0.000000 

FBD 

2 .000000 

0.000000 

F3S 

7.000000 

0.000000 

F3S2 

0.000000 

0.000000 

FCB 

0.000000 

1.000000 

FDB 

0.000000 

0.000000 

FEB 

0.000000 

0.000000 

FCD 

0.000000 

l.OCCOOO 

FCE 

0.000000 

1.000000 

FCS3 

16.000000 

0.000000 

FDC 

5.000000 

0.000000 

FEC 

6.000000 

0.000000 

FDS 1 

0.000000 

0.000000 

FDS2 

0.000000 

0.000000 

FDS3 

6.000000 

0.000000 

FS1D 

1.000000 

0.000000 

<  FS21) 

4.000000 

0.000000 

1  FES1 

0.000000 

0.000000 

|  FES3 

4.000000 

0.000000 

!  F31E 

3.000000 

0.000000 

i  FS4S1 

11.000000 

0.000000 

FS2B 

11.000000 

0.000000 

FS4S2 

15.000000 

0.000000 

ROW 

SLACK 

DUAL  PRICES 

2 ) 

0 . UOOOOO 

0.000000 

3) 

0.000000 

0. UOOOOO 

4) 

0.000000 

1 .000000 

,  5) 

0.000000 

0 . 000000 

6) 

0. 000000 

0.000000 

,  7> 

0.000000 

0.000000 

f*»  rn 
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8) 

9) 

10) 

11) 
7  \ 

;;> 

14) 

15) 

16) 
11) 
13) 

19) 

20) 
81) 
2") ) 

23) 

24) 
23) 
2=) 
21) 
23) 

—  '  i 


-) 
■  \ 

36) 

31) 

38) 

39) 

40) 

41) 


0.000000 
0.000000 
0.000000 
25.000000 
1 3 . 000000 
6.000000 
6.000000 
0.000000 
3.000000 
0.000000 
4 . 000000 
0.000000 
2.000000 
5.000000 
7.000000 
1.000000 
S. 000000 
25 .000000 
14.000000 
5.000000 
0.000000 
6.000000 
0.000000 
1.000000 
0.000000 
-  .  000000 
0.000000 
3.000000 
0.000000 
9.000000 
0.000000 
0.000000 
935.000000 
989.000000 


NO.  ITERATIONS-  IS 


DO  RANGE (SENSITIVITY)  ANALYSIS? 
>*N0 


: "QUIT 


t 


0.000000 
1 .000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
1.000000 
0.000000 
0.000000 
0.000000 
I. 000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
1.000000 
0.000000 
1.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
0.000000 
1.000000 
1.000000 
0.000000 
0.000000 


DATE 

FILMED 


